support openai compatible model reasoning content in streaming response by UnderTreeTech · Pull Request #13 · volcengine/veadk-go

UnderTreeTech · 2026-01-03T08:51:30Z

support reasoning content in streaming response, therefore user can custom AfterModelCallback to filter thought response during streaming llm call.

sjy3 · 2026-01-04T07:56:44Z

When the thinking mode is activated, the web page will generate multiple messages. If you are willing to fix the above issue, please resubmit a pull request (PR).

UnderTreeTech · 2026-01-05T01:30:28Z

@sjy3 Add a standard AfterModelCallback implementation that filters thought content. This allows flexible control over thought visibility:

func ThoughtFilterCallback(ctx agent.CallbackContext, llmResponse *model.LLMResponse, llmResponseError error) (*model.LLMResponse, error) {
    if llmResponseError != nil || llmResponse == nil || llmResponse.Content == nil {
        return nil, nil
    }

    var filteredParts []*genai.Part
    hasThought := false

    for _, part := range llmResponse.Content.Parts {
        if !part.Thought {
            filteredParts = append(filteredParts, part)
        } else {
            hasThought = true
        }
    }

    if hasThought {
        newResponse := *llmResponse
        newResponse.Content = &genai.Content{
            Role:  llmResponse.Content.Role,
            Parts: filteredParts,
        }
        return &newResponse, nil
    }

    return nil, nil
}

Usage:

agent, err := llmagent.New(llmagent.Config{
    Name: "MyAgent",
    Model: model,
    AfterModelCallbacks: []llmagent.AfterModelCallback{
        ThoughtFilterCallback,  // Filter thought content
    },
})

Benefits

Flexibility: Can be enabled/disabled per agent by including/excluding the callback
Consistency: Provides a standard approach across different OpenAI-compatible providers
No Breaking Changes: Fully opt-in via the existing callback mechanism
Frontend/Backend Choice: Decision to filter can be made at either layer

Backend filtering: Add the callback to filter thought content before sending to frontend
Frontend filtering: Don't use the callback, let frontend decide whether to display thought content based on part.Thought flag

UnderTreeTech added 2 commits January 3, 2026 16:49

Enhance response handling with reasoning content

6b020f4

Fix buildFinalResponse call in test cases

672d84e

UnderTreeTech changed the title ~~support reasoning content in streaming response~~ support openai compatible model reasoning content in streaming response Jan 4, 2026

sjy3 merged commit d742344 into volcengine:main Jan 4, 2026
2 checks passed

sjy3 mentioned this pull request Jan 4, 2026

Revert "support openai compatible model reasoning content in streaming response" #15

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support openai compatible model reasoning content in streaming response#13

support openai compatible model reasoning content in streaming response#13
sjy3 merged 2 commits intovolcengine:mainfrom
UnderTreeTech:main

UnderTreeTech commented Jan 3, 2026 •

edited

Loading

Uh oh!

Uh oh!

sjy3 commented Jan 4, 2026

Uh oh!

UnderTreeTech commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

UnderTreeTech commented Jan 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sjy3 commented Jan 4, 2026

Uh oh!

UnderTreeTech commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

UnderTreeTech commented Jan 3, 2026 •

edited

Loading